Rank in Wordlist | Frequency | Word |
---|---|---|
2969 | 36202 | 1,5 |
4337 | 23978 | 2,5 |
5083 | 19885 | 1,2 |
5807 | 17098 | 3,5 |
5911 | 16708 | 1,3 |
6500 | 14963 | 1,6 |
6527 | 14896 | 1,4 |
6808 | 14210 | 1,8 |
7100 | 13442 | 1,1 |
7429 | 12752 | 1,7 |
Rank in Wordlist | Frequency | Word |
---|---|---|
270557 | 101 | Bis(s) zum Ende der Nacht |
298012 | 87 | (T)Raumschiff Surprise |
360135 | 65 | Breaking Dawn - Bis(s) zum Ende der Nacht |
431777 | 49 | Coop Himmelb(l)au |
483315 | 41 | Bis(s) zum Abendrot |
524707 | 36 | Bis(s) zum Morgengrauen |
601875 | 29 | Bis(s) zur Mittagsstunde |
603029 | 29 | Eclipse - Bis(s) zum Abendrot |
724824 | 22 | Museum für Energiegeschichte(n) |
869626 | 16 | (I Can't Get No) Satisfaction |
Rank in Wordlist | Frequency | Word |
---|---|---|
267015 | 103 | .) |
270557 | 101 | Bis(s) zum Ende der Nacht |
298012 | 87 | (T)Raumschiff Surprise |
360135 | 65 | Breaking Dawn - Bis(s) zum Ende der Nacht |
431777 | 49 | Coop Himmelb(l)au |
483315 | 41 | Bis(s) zum Abendrot |
524707 | 36 | Bis(s) zum Morgengrauen |
601875 | 29 | Bis(s) zur Mittagsstunde |
603029 | 29 | Eclipse - Bis(s) zum Abendrot |
659157 | 25 | .“) |
Rank in Wordlist | Frequency | Word |
---|---|---|
16369 | 4860 | 10% |
20966 | 3594 | 20% |
21484 | 3490 | 50% |
22086 | 3370 | 5% |
22359 | 3320 | 100% |
27576 | 2537 | 30% |
27972 | 2491 | 2% |
29070 | 2365 | 1% |
31040 | 2174 | 3% |
31264 | 2153 | 15% |
Rank in Wordlist | Frequency | Word |
---|---|---|
12036 | 7134 | S&P |
14334 | 5728 | & Co |
21579 | 3473 | H&M |
22773 | 3243 | S&P 500 |
32913 | 2014 | GmbH & Co. KG |
34374 | 1900 | Standard & Poor's |
36811 | 1734 | AT&T |
48571 | 1194 | Ernst & Young |
49354 | 1168 | Heckler & Koch |
49786 | 1154 | C&A |
Rank in Wordlist | Frequency | Word |
---|---|---|
275765 | 98 | A$AP |
340017 | 71 | A$AP Rocky |
485445 | 41 | Ke$ha |
1422038 | 8 | Wall$treet |
1550278 | 7 | Ty Dolla $ign |
1653490 | 6 | Ke$has |
1764101 | 5 | 50$-Marke |
1890482 | 5 | Mrd. $. |
2029706 | 4 | Ar$ch |
2059695 | 4 | CLa$$ic |
Rank in Wordlist | Frequency | Word |
---|---|---|
118 | 934603 | ." |
301914 | 86 | Toys "R" Us |
819046 | 18 | Hochschule für Schauspielkunst "Ernst Busch" |
1269802 | 9 | Hochschule für Musik "Hanns Eisler" |
1410923 | 8 | Stiftung "Erinnerung, Verantwortung und Zukunft" |
1839844 | 5 | Gymnasium "In der Wüste" |
1854660 | 5 | Jagdgeschwader 71 "Richthofen" |
2146448 | 4 | Jagdbombergeschwader 31 "Boelcke" |
2273797 | 4 | Stanley "Tookie" Williams |
2364388 | 3 | "Love and Theft" |
Rank in Wordlist | Frequency | Word |
---|---|---|
8010 | 11596 | gibt's |
8108 | 11431 | geht's |
13197 | 6340 | Let's |
14975 | 5424 | Let's Dance |
19817 | 3850 | McDonald's |
22038 | 3381 | Moody's |
22273 | 3336 | 10'000 |
22599 | 3276 | .' |
23082 | 3187 | 100'000 |
23153 | 3173 | gab's |
Rank in Wordlist | Frequency | Word |
---|---|---|
23220 | 3161 | K+S |
54194 | 1030 | 50+1-Regel |
54195 | 1030 | 90.+1 |
55545 | 996 | 90.+2 |
62139 | 855 | 90.+3 |
66069 | 787 | Gruner + Jahr |
71161 | 712 | Google + |
72114 | 699 | 45.+1 |
96895 | 461 | 90.+4 |
100752 | 436 | 50+1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
364341 | 64 | Get the F*ck out of my House |
653788 | 26 | Sagittarius A* |
1699318 | 6 | Sag A* |
1932859 | 5 | Sgr A* |
3495023 | 2 | I’m Not a F**king Princess |
3897889 | 2 | Schwules Museum* |
5026249 | 1 | Berufsverband Bildender Künstler*innen Berlin |
5815125 | 1 | Family *5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3720 | 28548 | km/h |
4552 | 22669 | dpa/tmn |
6426 | 15218 | Frankfurt/Main |
13381 | 6231 | CDU/CSU |
14508 | 5644 | awp/sda |
15045 | 5392 | dpa/lnw |
15185 | 5330 | und/oder |
15892 | 5037 | 2017/18 |
16617 | 4770 | 2018/19 |
17286 | 4552 | 90/Die |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots